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ABSTRACT 

Subhalo abundance matching (a.k.a. SHAM) is a technique for populating simulated 
■ dark matter distributions with galaxies, assuming a monotonic relation between a 

galaxy's stellar mass or luminosity and the mass of its parent dark matter halo or 
subhalo. We examine the accuracy of SHAM in two cosmological smoothed particle 
hydrodynamics (SPH) simulations, one of which includes momentum— driven galactic 
winds. The SPH simulations indeed show a nearly monotonic relation between stellar 
mass and halo mass provided that, for satellite galaxies, we use the mass of the subhalo 
at the epoch Zsat when it became a satellite. In each simulation, the median relation 
for central and satellite galaxies is nearly identical, though a somewhat larger fraction 
of satellites are outliers because of stellar mass loss. SHAM-assigned masses (at z = 
— 2), luminosities (i?-band at z — Q), or star formation rates (at z — 2) have a 68% 
scatter of 0.09 — 0.15 dex relative to the true simulation values. When we apply SHAM 
to the subhalo population of a coUisionless N-body simulation with the same initial 
conditions as the SPH runs, we find generally good agreement for the halo occupation 
distributions and halo radial profiles of galaxy samples defined by thresholds in stellar 
mass. However, because a small fraction of SPH galaxies suffer severe stellar mass loss 
after becoming satellites, SHAM slightly overpopulates high mass halos; this effect 
is more significant for the wind simulation, which produces galaxies that are less 
massive and more fragile. SHAM recovers the two-point correlation function of the 
SPH galaxies in the no- wind simulation to better than 10% at scales 0.1/i~^Mpc < r < 
10/i~^Mpc. For the wind simulation, agreement is better than 15% at r > 2/i~^Mpc, 
but overpopulation of massive halos increases the correlation function by a factor 
~ 2.5 on small scales. The discrepancy in the wind simulation is greatly reduced if 
we raise the stellar mass threshold from GxIO^Mq to SxlO^^M©; in this case SHAM 
overpredicts the SPH galaxy correlation function by ~20% at r < l/i~^Mpc but agrees 
well with SPH clustering at larger scales. 

Key words: galaxies: evolution — galaxies: formation — models: semi-analytic — 
models: numerical 



1 INTRODUCTION 

In the standard theoretical description of galaxy formation, 
galaxies form by the dissipation of the baryo nic component 
within coUisionless dark ma tter halos (e.g. IWhite fc Reed 
1 19781 : iFall fc Efstathiodll98d ) . When a dark matter halo en- 
ters the virial radius of a more massive halo, it is subjected 



to tidal stripping and potentially to disruption. Nonethe- 
less, high resolution N-body simulations have indicated that 
mass ive halos retain a substantial amount of substructur e 
(Klv pin et al]|l999l : iMoore et al1ll999l : ISpringel et alll200ll ') , 
consisting of bound dark matter clumps orbiting within the 
potential of their host halo. Evidently, such subhalos were 
themselves independent, self-contained halos in the past, be- 
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fore merging with a more massive halo. If sufficiently mas- 
sive, these subhalos were sites of baryon dissipation and 
star formation in the past. There are many indications from 
studies of the statistical properties of how gala:xies and sub- 
structures populate halos that galaxies in groups and clus- 
ters are in fa ct the observation al co unterparts of su b halos . 
For example, IColm et al. I 11999') and 'Kravtsov et al.l 
show that the correlation functions of substructures in high 
resoution N-body simulations are in good agreement with 
the obse rved correlation functions of galaxies. On the the- 
ory side, iKravtsov et al.l (|2004l ) find that the distribution of 
subhalos in high resolution N-body simulations is similar to 
that of smoothed pa rticle hydrodynamics (S PH) galaxies in 
iBerlind et all l|2003l ) and lZheng et all (|2005l '). 

Subhalo abundance matching (SHAM) is a technique 
for assigning observable galaxy properties to a halo/subhalo 
population in an N-body simulation. It is based on as- 
suming a monotonic relationship between observable prop- 
erties of galaxies and dynamical properties of dark mat- 
ter substructures. As subhalos that fall into the virial ra- 
dius of more massive halos are subject ed to stripping and 
tidal disruption, several authors (e.g. Conrov et al.l 120061 : 
IVale fc Qstrikeill2006l : iMoster et al.ll2010[ l contend that the 
properties of satellite galaxies should be better correlated 
with the properties of subhalos at the time of their accretion 
onto a more massive halo rather than their present day prop- 
erties. By using the SHAM techniq ue, with this accretion- 
epoch matching. IConrov et al.l l|2006l ) match the observed lu- 
minosity dependence of galaxy clustering at a wide range of 
epochs, ranging from z = to 2 ~ 5. Assuming a monotonic 
relationship between galax y mass and halo m ass and using 
the same SHAM technique, IGuo et al] (|201G| ) reproduce the 
observationally inferred relation between stellar mass and 
halo virial mass. Trujillo-Gomez et al., (201Qf) use the abun- 
dance matching technique to match the observed relations 
between stellar mass and circular velocity and luminosity 
and circular velocity, and to match the estimated galaxy 
velocity function. 

In this paper, we investigate the effectiveness of subhalo 
abundance matching (SHAM) in cosmological SPH simula- 
tions, where we know exactly the relation between the prop- 
erties of galaxies and the masses of their parent halos and 
subhalos. We investigate the degree to which there is a di- 
rect correspondence between the properties of dark matter 
substructures in a dissipationless numerical simulation of a 
cosmological volume and condensed baryons in numerical 
simulations (with the same initial conditions) that include 
a dissipative compon ent. We extend the similar study of 
[Weinberg et al.l (|2008l ) in several ways. Firstly, our simula- 
tion volume is more than ten times larger. Secondly, one of 
our dissipative simulations includes ejective feedback in the 
form of momentum driven winds, which curtail star forma- 
tion and produce a stellar mass function that is in better 
agreement with the observations. Lastly and perhaps most 
importantly, we relate galax;y properties to the subhalo mass 
at the epoch of accretion rather than at the present time (see 
comparison in §4) . We make a direct assessment of the ef- 
fectiveness of SHAM as a method for assigning stellar mass, 
luminosity or star formation rate to subhalos and investigate 
the sources of its breakdown. 

Our investigation offers insight into the physical mech- 
anisms that shape galaxy masses and luminosities in these 



simulations, and it also has practical import. If the sub- 
halo population in the purely gravitational simulation does 
indeed trace the observable properties of the galaxy popula- 
tion, it enables us to make observable predictions of quan- 
tities like the luminosity dependence of galaxy clustering 
based on computationally less expensive N-body simulations 
instead of hydrodynamic simulations. SHAM also offers a 
relatively inexpensive tool for creating artificial galaxy cata- 
logues to support statistical analyses of large scale structure 
data sets. 

In §2, we describe our simulation, our methods for iden- 
tifying halos, subhalos and galaxies, and the subhalo abun- 
dance matching scheme. In §3, we investigate the relation- 
ship between the stellar masses, luminosities, and star for- 
mation rates of SPH galaxies and dark matter substructures 
in our SPH simulations to test some of the underlying as- 
sumptions of the subhalo abundance matching technique. In 
§4, we investigate whether the SPH galaxy population can 
be recovered from the subhalo population in our matched 
N-body simulation; note that it is this comparison rather 
than the investigations in §3 that tests SHAM as it has been 
traditionally implemented. Finally, in §5, we summarise our 
results and discuss their implications. 



2 METHODS 
2.1 Simulations 

Our simulations ar e performed u sing the GADGET-2 
code llSpringej|2005l ) as modified by lOppenheimer fc Pavel 
l|2008t ). Gravitational forces are calcu lated using a combina- 
tion of the Particle Mesh algorithm (|Hocknev fc Eastwood! 
198]]) for large distances and the hierarchic al tree al- 
gorithm (|Barnes fc Hull Il986l : iHernguistl Il987l ) for short 
distances. The smoothed particle hydrodynamics (SPH) 
algorithm is entropy and en er gy c onserving and is 
based on ISpringel fc Hernguistl l|2002r) . The details of 
the treatment of radiative cooling can be found in 
iKatz. Weinberg fc Hernguistl (|l996l) . Gas can dissipate en- 
ergy via Compton cooling and radiative cooling, com- 
puted assuming a primo rdial gas composit i on an d a back- 
ground UV flux based on lHaardt fc Madaul (120011 ). The de- 
tails of the treatment of s tar formation can be found in 
ISpringel fc Hernguistl (|2003h . Briefly, each gas particle satis- 
fying a temperature and density criterion is assigned a star 
formation rate, but the conversion of gaseous material to 
stellar material proceeds stochastically. The parameters for 
the star formation model are selected so as to match the 
z = relation between star form ation rate and gas density 
(|Kennicuttlll99i : ISchmidt|[l959l) . 

We adopt a ACDM cosmology (inflationary cold dark 
matter with a cosmological constant) with Q.m ~ 0.25, JIa ~ 
0.75, h = i/o/100 km s"^Mpc"^=0.7, fit = 0.044, spectral 
index Us = 0.95, and the amplitude of the mass fluctuations 
scaled to ag ~ 0.8. These values are reasonably close to 
current estimates f rom the cosmic microwave background 
llLarson et al.l I2OI0I ) and large scale structure (iReid et al.l 
I2OI0I ). We do not expect minor changes in the values of the 
cosmological parameters to affect our conclusions. 

We follow the evolution of 288^ dark-matter particles 
and 288^ gas particles, i.e. just under 50 million particles in 
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total, in a comoving box that is 50^ ^ Mpc on each side, 
from z = 129 to 2: = 0. The dark matter particle mass 
is 4.3 X 10* Mq, and the SPH particle mass is 9.1 x lO'^ 
Mq. The gravitational force softening is a comoving 5h~^ 
kpc cubic spline, which is roughly equivalent to a Plummer 
force softening of S.5h~^ kpc. 

One of our simulations, SPHw (SPH winds) in- 
corporates kinetic feedb ack through momentum d riven 
winds as implemented bv lOppenheimer fc Pavel (|2006h and 
lOppenheimer fc Pavel (|200g ) where the details of the im- 
plementation can be found. Briefly, wind velocity is propor- 
tional to the velocity dispersion of the galactic halo, and the 
ratio of the gas ejection rate to the star formation rate is in- 
versely proportional to the velocity dispersion of the galactic 
halo. 

We also carry out a simulation with the same cosmolog- 
ical and numerical parameters as the SPHw simulation, but 
without momentum driven winds, SPHnw (SPH No Winds). 
Although no energy is kinetically imparted to SPH particles, 
the SPHnw simulation includes thermal feedback from su- 
pernovae (see Springcl & Hcrnquist 2003). However, since 
the surrounding gas is dense, the energy is radiated away 
before it can drive outflows or significantly suppress star 
formation. Our SPHnw and SPHw si mulations are analo- 
:ous t o the nw and vzw simulations of lOppenheimer et al.l 
20ld ). who investigate the growth of galaxies by accretion 
and wind recycling and compare predicted mass functions 
to obs ervations. Our simulations are also used bv IZu et al.l 
who investigate intergalactic dust extinction. 
In addition to the two SPH simulations, we carry out 
a non-dissipative, purely gravitational, N-body simulation 
with identical cosmological and numerical parameters and 
the same initial positions and velocities of particles as the 
SPH simulations, except that the dark matter particle mass 
is higher by a factor of Qrn/{^m — J^t) to compensate for 
the gravitational effects of not including baryons. We refer 
to this N-body simulation as the PM (dark matter only) 
simulation. 



2.2 Identification of Groups and Substructures 

We identify dark matter haloes us ing a FOF (friends-of- 
friends) algorithm (|Pavis et al.ll 19851 ). The algorithm selects 
groups of particles in which each particle has at least one 
neighbour within a linking length. We assign a mass to the 
halos using a SO (spherical overdensity) criterion, with the 
threshold density set to the viri al overdensity in spherical 
collapse (|Kitavama fc Sutolll996l ). At 2 = 0, the mean in- 
terior overdensity with respect to the critical density of the 
Universe is 94. We set the centre of the group at the most 
bound FOF particle and go out in radius until the mean 
density enclosed is equal to the virial density. While FOF 
occasionally links multiple distinct concentrations together, 
most FOF halos c ontain a distinct m ass concentration (see 
Figures 1 and 2 of ISimha et"ai] l|2009l )). 

To iden tify substructure s within halos, we use the Adap- 
taHOP (Aubert et all |2004|) code. The details of the al- 
gorithm can be found in lAubert et al] (|2004l ). so we pro- 
vide only a brief summary of it here. We calculate den- 
sities around each dark matter particle using an SPH-like 
kernel estimator, with a cubic spline kernel containing 32 
neighbours. We then partition the ensemble of particles into 



"peak patches" where a peak patch is a set of particles with 
the same local density maximum. The connectivity between 
peak patches is dictated by the saddle points in the density 
field. We identify subsets of particles in each peak patch with 
SPH density larger than the density of the highest saddle 
point connecting it to a neighbouring peak patch. To select 
statistically significant substructures as opposed to random 
fluctuations in the density field, we require that the mean 
density of a substructure be 4a above the SPH density of 
the saddle point. The performance of Adapt aHOP on a sim- 
ulation similar to that here is illustrated in I Weinberg et all 
(2008) (see their Figure 1). 

Hydrodynamic cosmological simulations that incorpo- 
rate cooling and star formation produce dense groups of 
baryons with sizes and masses comparable to the luminou s 
regions of observed galaxies (Katz 1992; E vrard et al.lll9S)3 ). 
We identify galaxies using the Splin e Kernel Interpolative 
PENMAX (SKIeEI) algorithrn (iGelb fc Bertschingeilll994l : 
iKatz. Weinberg fc Hernquisti 1 19961 ). which identifies gravi- 
tationally bound particles associated with a common den- 
sity maximum. We refer to the groups of stars and cold gas 
thus identified as galaxies. The simulated galaxy popula- 
tion becomes substantially inco mplete below our re solution 
threshold of ~64 SPH particles (|Murah et al.ll2002l ). which 
corresponds to a baryonic mass of 5.8 xlO'^ Mq. Although 
this threshold applies to the total baryonic mass (stars plus 
cold, dense gas) of galaxies, we adopt it as our threshold for 
stellar mass and ignore galaxies with lower stellar mass. 

Figure [1] compares the galaxy stellar mass function in 
the SPHnw and SPHw simulations at z = 0. In the SPHnw 
simulation, there are 7,952 galaxies above our resolution 
threshold, corresponding to a space density of 0.064 
Mpc~^. In the SPHw simulation there are only 2,264 gala:x- 
ies above our resolution threshold, corresponding to a space 
density of 0.018 h'^ Mpc~'^ because wind feedback pushes the 
stellar mass of many galaxies below the 64 mspH threshold. 
The two mass functions gradually converge towards higher 
masses, joining at Ms > 10^^ * M©, because wind feedback 
has less suppressing effect in larger systems since the amount 
of material ejected in this model scales inversely with circu- 
lar velocity. lOppenheimer et all (|2010l ) discuss the compar- 
ison between the predicted stellar mass functions and ob- 
servational estimates in some detail. Roughly speaking, the 
SPHw model reproduces observational estimates for Ms < 
10^^ M0 , but it predicts excessive galaxy masses (at a given 
space density) for Mg > lO"'* Mq. 



2.3 Subhalo Abundance Matching 

Subhalo abundance matching (SHAM) is a technique for as- 
signing galaxies to simulated dark matter halos and subha- 
los. The essential assumptions are that all galaxies reside in 
identifiable dark matter substructures and that luminosity 
or stellar mass of a galaxy is monotonically related to the 
potential well depth of its host halo or subhalo. Some imple- 
mentations use the maximum of the circular velocity profile 
as the indicator of potential well depth, while others use 
halo or subhalo mass. The first cle ar formulations of SH AM 
as a systematic method appear in IConrov et al.l (|2006l ) and 
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IVale fc Qstrike j | 20061 ) , but these build on a number of pre- 
vious studies that either test the underpinnings of SHAM 
or impUcitly ass u me SHAM-Uke galaxy assignment (e.g. 
Colfn et al.ril999l : iKraytsov erahl l2004l : iNagai fc Kravtsovl 
2005^ ■ 

N-body simulations produce subhalos that are located 
within the virial radius of SO halos. The present mass of 
subhalos is a product of mass build up during the period 
when the halo evolves in isolation and tidal mass loss af- 
ter it enters the viri a l radius of a more mass ive halo (e.g. 
iKravtsov et al.1 12004| : iKazantzidis et al.lliooj '). The stellar 
component, however, is at the bottom of the potential well 
and more tightly bound making it less likely to be affected 
by tidal forces. Therefore, se veral authors (e.g. IConrov et al.l 
l2006l : IVale fc QstrikedlioO^ ) argue that the properties of the 
stellar component should be more strongly correlated with 
the subhalo mass at the epoch of accretion rather than at 
z = Q. 

IVale fc Ostrikerl l|2006l ) apply a global statistical cor- 
rection to subhalo m asses rel ative to halo mas ses (as do 
[Weinberg et al.l 120081 '). while ,Conrov et al.l (|2006t ) explicitly 
identify subhalos at the epoch of accretion and use the maxi- 
mum circular vel ocity at that epoch. O ur formulation here is 
similar to that of IConrov et al.l l|2006l ). though we use mass 
rather than circular velocity. Specifically, we assume a mono- 
tonic relationship between stellar mass and halo mass and 
determine the form of this relation by solving the implicit 
equation 

ns(> Ms) = nH{> Mh), (1) 

where ns and nn are the number densities of galaxies and 
halos, respectively, Ms is the galaxy stellar mass threshold, 
and Mh is the halo mass threshold chosen so that the num- 
ber density of halos above it is equal to the number density 
of galaxies in the sample. The quantity A/h is defined as 
follows: 

_ |A4aio(2 = 0) for distinct halos, 
\Mhaio(2 = 2sat) forsubhalos, 

where Zsat is the epoch when a halo first enters the virial 
radius of a more massive halo. We also consider (in §3.2) 
variants of this procedure in which we substitute R-band 
luminosity or instantaneous star formation rate for stellar 
mass in eq.(l). 

In §3, we use our SPH simulations to test the degree 
to which galaxy stellar mass and luminosity are monotonic 
functions of halo/subhalo mass, as assumed in SHAM. For 
independent halos, we set Mh equal to the 2 = mass of 
SO halos. We use the AdaptaHOP code to identify subhalos 
hosting satellite galaxies above the resolution threshold at 
z = in our SPH simulations. We then use their particle 
membership to identify their progenitor SO halos at Zsat, 
whose mass we adopt as Mh- We can identify a host sub- 
halo for the vast majority of our galaxies at z = 0. However, 
for about one percent of galaxies we cannot identify any as- 
sociated substructure. In these cases, we use the galaxy's 
baryonic particles to trace its high redshift progenitors up 
to the epoch when it was the only galaxy in an SO halo and 
adopt the mass of this SO halo as Mh- After matching, we 
examine the correlations of galaxy properties with Mh for 
both central and satellite systems, and we compare the stel- 
lar masses, luminosities, and (at high redshift) star forma- 



tion rates that would be assigned by monotonic matching to 
the simulation values. Of course, SHAM is usually applied to 
coUisionless N-body simulations, not SPH simulations, and 
the subhalo populations can differ even for the same initial 
conditions because of the dynamical effects of the dissipative 
baryons on the dark matter. 

In §4, we populate halos/subhalos in our DM-only sim- 
ulation using SHAM, following a procedure akin to the one 
usually used to match simulated halos/subhalos to observed 
galaxies. For independent halos, we set Mh equal to the 
2 = mass of SO halos. We identify subhalos using the 
AdaptaHOP code and use their particle membership to iden- 
tify their progenitor SO halos at Zsat, whose mass we adopt 
as Mh- We have checked that at z^ax , the relation between 
^^max and Mh is close to monotonic, so we would get similar 
results from using Umax instead of Mh- For comparison, we 
also implement a procedure that we refer to as SHAMzO, 
where we set Mh equal to the z = mass for both indepen- 
dent halos and subhalos identified in our N-body simulation. 

iKnebe et al.l (|201ll ) find differences in the performance 
of various halo finding algorithms in identifying substruc- 
tures and accurately recovering their properties, particularly 
for subhalos containing fewer than 40 particles. Since we 
trace the Zsat progenitors of 2 = substructures, our results 
are unlikely to be affected by random fluctuations in the 
density field of halos that may be spuriously identified as 
substructures. However, if subhalos hosting satellite galax- 
ies that have merged with the central galaxy of the halo are 
identified as substructures, we would overestimate the halo 
occupation of massive halos. Conversely, subhalos that fall 
into more massive halos and lose a substantial fraction of 
their mass due to tidal stripping may no longer be resolved 
in the simulation at 2 = even though they might survive 
and host satellite galaxies in a sufficiently high resolution 
calculation. 



3 SHAM IN THE SPH SIMULATIONS 
3.1 Stellar Mass 

The left column of Figure[2]shows galaxy stellar mass plotted 
against halo mass in the SPHnw simulation (top) and the 
SPHw simulation (bottom) . For galaxies that are the central 
objects of their parent halos (henceforth central galaxies), 
the halo mass, Mh, on the horizontal axis, is the mass of 
the SO halo in which the galaxy is located at 2 = 0. For 
galaxies that are not the central objects of their respective 
SO halos (henceforth satellite galaxies), Mh is the mass of 
the SO halo in which the galaxy is located at 2sat, the last 
output epoch before its parent halo fell into the virial radius 
of a more massive halo; 2aat values range between 2 = 3 and 
2 = 0.05. The black solid curve and red dotted curve show 
the median galaxy stellar mass in evenly spaced logarith- 
mic bins of halo mass for all galaxies and satellite galaxies 
respectively, while the red open circles and black open rect- 
angles show individual galaxies that are within the top five 
percent and bottom five percent by stellar mass in each halo 
mass bin. 

The key result of Figure [5] is that the ratio of galaxy 
mass to Mh is similar for central galaxies and satellite galax- 
ies. Between 2aat and 2 = 0, satellite galaxies have lower 
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gro wth rates compared to central galaxi es of similar mass 
(see lKeres et al.ll2009l : ISimha et all l2009'). However, during 
the same time period, while central galaxies grow at a faster 
rate, their host halos also accrete mass and "receive" merg- 
ers of lower mass halos. The balance between stellar mass 
growth and halo mass growth leads to a similar Ms-Mh 
relation at 2; = for central and satellite systems, in both 
simulations. 

Note that in Figure [2] as well as in the remainder of this 
section, we only consider galaxies that are above our reso- 
lution threshold at « = 0, with Ms > 64mspH = 5.8x10^ 
Mq. However, some satellite galaxies in dense environments 
experience mass loss between z^at and z = 0, and, conse- 
quently, a fraction of satellites that are above the resolution 
threshold at Zsat are pushed below it by 2: = 0. We defer 
examination of this point to §4. 

We assign galaxies to the halo/subhalo population via 
the monotonic mapping procedure described in §2. The 
right-hand side column of Figure [2] shows the distribution of 
the ratio of the stellar mass assigned to a halo to the stellar 
mass of the SPH galaxy that is actually located within it for 
the SPHnw simulation (top) and the SPHw simulation (bot- 
tom). In addition to the distribution for all galaxies (black 
solid curve), we show the distribution for satellite galaxies 
alone (red dotted curve). For most halos, the SHAM as- 
signed mass is close to the mass of the SPH galaxy located 
within it in both the SPHnw and the SPHw simulations, 
although there are a small number of extreme outliers. We 
characterise the width of the distributions by ctm such that 
68% of galaxies have |_R| = |log Ma/Mr\ < cjm where Ma 
is the assigned mass and Mr is the SPH (real) mass. In 
the SPHnw simulation, the width of the distribution is 
= 0.09, while in the SPHw simulation cta/ — 0.11. In both 
simulations, the distribution of SHAM assigned masses for 
satellite galaxies is also centered on the SPH mass, but there 
is greater scatter than in the case of central galaxies. For 
satellite galaxies in the SPHnw simulation, the distribution 
of assigned mass is skewed such that the assigned mass is 
likely to be slightly higher than the SPH mass. 

Figure [3] presents the same distribution of |log Ma/MrI 
at z = 0.5, 1 and 2, and compared to the z = result from 
Figured) There is good agreement between the SHAM as- 
signed stellar masses and the SPH stellar masses at all four 
epochs. The fraction of satellite galaxies decreases with in- 
creasing redshift. Despite this, au shows a continuous trend 
of increasing with redshift, from 0.09 at z = to 0.13 at 
2: = 2 in the SPHnw simulation, and from 0.11 at z = to 
0.14 at 2; = 2 in the SPHw simulation. Although the total 
baryonic mass (not shown) is equally well correlated with 
halo mass at z — 2 and at z = 0, the higher mean gas frac- 
tion and the larger halo-to-halo scatter in gas fraction at 
high redshift leads to higher scatter in the halo mass-stellar 
mass relation. 

While the relationship between halo mass and galaxy 
stellar mass is roughly monotonic in our SPH simulations, 
there is some scatter, with the strongest outliers arising from 
satellite galaxies. We examine the sources of this scatter in 
Figure[4l both to understand the physical processes that give 
rise to it and to explore the possibility of adding a parameter 
that would sharpen the subhalo abundance matching. We 
restrict ourselves to 2 = as the satellite galax;y sample 
is largest at this epoch. Each panel plots satellite galaxy 



stellar mass as a function of A4h at Zsat with SPHnw in the 
top panels and SPHw in the bottom. 

In panel (a), points are colour coded by the mass of the 
2 = halo (not subhalo) that hosts the satellite, and lines 
show the median relation between Ms{z = 0) and Mnizaat ) 
in four bins of 2 = halo mass. The median curves are 
nearly identical for the four bins, indicating that the typical 
Ms / Mh is at most minimally correlated with the final halo 
mass. The outlier points at low Ms/ Mh are mostly galaxies 
that have experienced stellar mass loss. These outliers are 
found in all Mhaio bins; one should not read too much into 
the relative numbers of outlier points as the total number of 
satellites varies from bin to bin. We caution that this figure 
does not show galaxies that have lost enough mass to fall 
below the Ms ~ 64 mspH threshold by 2 = (see Figure 
11, below). 

In panel (b), points and lines are coded by satellite ac- 
cretion epoch. Once again, there is little difference in the 
median relations and outliers are found in all the Zsat bins 
(except 2sat > 2). Panel (c) divides galaxies into those that 
have lost stellar mass since 2sat, those that have increased 
their stellar mass by less than 10% since Zaat and those that 
have increased their stellar mass by more than 10%. Not sur- 
prisingly, galaxies that have lost stellar mass have system- 
atically lower Ms/Mh at 2 = 0. Median relations for the 
other two populations are similar. There are some galaxies 
that are low Ms/Mh outliers despite having gained mass 
since Zaat , indicating that at least some of this outlier pop- 
ulation comes from satellites that had anomalously low Ms 
at 2sat. Panels (d) - (f) repeat this analysis for the SPHw 
simulation. The results are qualitiatively similar, though the 
smaller number of galaxies in this simulation makes it diffi- 
cult to draw firm conclusions. 

3.2 Luminosity and Star Formation Rate 

So far, we have used SHAM to assign stellar masses to ha- 
los and compare them to the stellar masses of SPH galaxies 
located in those halos. However, when SHAM is applied to 
an observed galaxy population, it is often used to assign 
luminosities to simulated halos to compare the luminosity 
dependence of the clustering properties of the simulated ha- 
los to that of the observed galaxies. 

The luminosity of a galaxy is correlated with its stellar 
mass, but is not reducible to a simple function of stellar 
mass because the stars formed at different times. We track 
the star formation histories of simulated galaxies in our SPH 
simulatio ns, and use th e stellar population synthesis package 
of Co nrov et al.l l|2009l ) to compute their luminosities. We 
assume that stars are formed with a Chabrier initial mass 
function. We assume solar metallicity and do not consider 
dust extinction. Changing the initial mass function in the 
same way for all galaxies would alter the zero-point of the 
luminosity-stellar mass relation but would not be likely to 
add scatter, while including dust extinction would shift the 
mean relation and increase the scatter somewhat. 

The left column of Figure[S]shows the r-band luminosity 
of SPH galaxies against halo mass in the SPHnw simulation 
(top) and the SPHw simulation (bottom). For galaxies that 
are the central objects of their parent halos (henceforth cen- 
tral galaxies), the halo mass, Mh, on the horizontal axis, 
is the mass of the SO halo in which the galaxy is located 
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at 2 = 0. For satellite galaxies, it is the mass of the SO 
halo in which the galaxy is located at z^ax , the last out- 
put epoch before its parent halo fell into the virial radius 
of a more massive halo. The points show individual galaxies 
that are either in the top 5% or bottom 5% by luminosity in 
each halo mass bin. Most of the low luminosity outliers are 
satellite galaxies. This is primarily because satellite galaxies 
typically have an older stellar population than central galax- 
ies of similar stellar mass and are, therefore, less luminous. 
A secondary factor, of less importance, is the difference be- 
tween central and satellite galaxies in the stellar mass-halo 
mass relation shown in Figure [5] 

To assign luminosities to our halos, we rank order ha- 
los by mass and assign simulated galaxies rank ordered by 
r-band luminosity to them, as done previously for stellar 
mass. The right column of Figure [5] shows the distribution 
of the ratio of the SHAM assigned luminosity of a halo to the 
luminosity of the SPH galaxy within it in the SPHnw simu- 
lation (top right) and the SPHw simulation (bottom right). 
In both simulations, the distribution of SHAM assigned lu- 
minosities is centered on the SPH luminosity. In analogy 
with the previous subsection, we define ul such that 68% of 
galaxies have \R\ = |log La/Lh| < ah where La is the as- 
signed luminosity and Lr is the SPH (real) luminosity. In the 
SPHnw as well as the SPHw simulation, itl = 0.15, which 
is greater than the corresponding ctm (see Figure [2)). For 
satellite gala:xies, the assigned luminosity is systematically 
higher than the SPH luminosity because of the stellar pop- 
ulation differences, the offset being 0.08 dex in the SPHnw 
simulation but only 0.02 dex in the SPHw simulation. 

The clustering properties of 2 > 2 galaxies are stud- 
ied observationally using the Lyman break technique, in 
which high redshift star forming galaxies are identified by 
optical photo metry alone using t heir redshi ft ed rest frame 
UV ra diation (|Steidel et al.lll996l . ll999l . l2003l ). IConrov et all 
l|2006l ) find good agreement between the clustering of Ly- 
man break galaxies (LBGs) and the clustering of halos in 
their N-body simulation when they use SHAM to match 
halos to galaxies. The relationship between LBGs and 
their host halos is important in understanding the prop- 
erties of LBGs, in partic ular whether they are a quiescent 
star-forming population ( Coles et al.l 1 19981 : IMo et al] 1 19991 : 
iGiavalisco fc Dickinson 2001) or a merg er driven starbust 
popu lation ((Somervil le ct al. 2001,; Scannapieco fc ThackeJ 
I2OO3I ). These applications raise the question of whether 
SHAM can be reliably applied to the rest-frame UV lumi- 
nosities of high redshift galaxies, which depend mainly on 
their star formation rates (SFRs) rather than their stellar 
masses. 

In our simulations, the SFRs and stellar masses of high 
redshift ga l axies are well correlated (see, e.g., figure 7 of 
iDave et all I2OIOI '). In the left panels of Figure [1 we di- 
rectly investigate the relationship between the z — 2 SFRs 
of our simulated galaxies and the properties of their par- 
ent halos, in the same format used previously for stel- 
lar mass and r-band luminosity a.t z — 0. To calculate 
the SFR of an SPH galaxy, we sum over the SFRs of 
all its gas particles compute d acco rding to the formula in 
iKatz. Weinberg fc Hernguistl (|l996l ). In contrast to z = 0, 
where there is a significant passive, non star forming popu- 
lation, only ~ 0.1 % of galaxies have no star formation at 
z = 2, and these are excluded from the analysis. In the ab- 



sence of dust extinction, the instantaneous SFR should be 
a good indicator of rest-frame UV luminosity, since the lat- 
ter is dominated by the output of young, short-lived stars. 
We caution, however, that dust extinction corrections for 
LBG UV luminos ities are typically factors of several (e.g. 
ISteidel et al]|2003l ). and a scatter in extinction at fixed SFR 
could add significant scatter to the relation between Mh and 
UV luminosity. Since colours provide an indication of extinc- 
tion, the most effective strategy for observational analysis is 
probably to apply colour based extinction corrections before 
subhalo abundance matching, so that only the errors in the 
corrections add scatter. 

Figure [5] shows that the relation between intrinsic SFR 
and Mh at z = 2 is nearly as tight as the relation between R- 
band luminosity and Mh at 2: = 0. We can therefore assign 
SFRs to SHAM galaxies via the same monotonic match- 
ing used previously for stellar mass and R-band luminosity, 
with the results shown in the right panels. The 68% scat- 
ter of |log SFRa/SFRhI is as = 0.12 and as = 0.16 for 
the SPHnw and SPHw simulations respectively, compared 
to the _R-band luminosity scatters of aL ~ 0.15 for both 
simulations at 2: = 0. The median relations for central and 
satellite galaxies are nearly the same, in both cases, with off- 
sets that are small compared to the intrinsic scatter. How- 
ever, in the no-wind simulation the outliers at low SFR are 
preferentially satellites, a result of the gradual shutoff of 
gas accretion after galaxies become sat ellites in larger ha- 
los l|Keres et al.ll2009l : ISimha et al.ll2009l ). The SFRa/SFRjj 
histogram is, therefore, skewed towards overestimated SFRs 
for satellites, though this remains a small effect. For the wind 
simulation, there are many fewer galaxies above our 64 mspH 
stellar mass threshold, and star formation rates at Mh < 
IO^'^ '^Mq are suppressed by the momentum driven outfiows. 
In this simulation, there are more satellite outliers at high 
SFR, and the (noisy) histogram of SFRa/SFR^ for satel- 
lite galaxies shows an overall shift toward underestimated 
SFR. For central galaxies, gas fractions in the SPHw sim- 
ulation are systematically higher than those in the SPHnw 
simulation, and scatter in these central galaxy gas fractions 
(contributing scatter in SFR) is the largest factor driving 
higher as for the SPHw case. 

Figure [S] suggests that SHAM should be a reliable tool 
for assigning SFRs to simulated halo populations at high 
redshift (given an observed SFR distribution), or for infer- 
ring the halo masses associated with observed Lyman-break 
galaxies. We caution that our simulations do not produce a 
substantial population of passive galaxies at these redshifts, 
either central or satellite, and that physical mechanisms that 
strongly suppress star formation at high stellar mass could 
reduce the accuracy of SHAM assignment. 



4 SHAM IN THE DM SIMULATION 

So far we have applied subhalo abundance matching to 
the halo and subhalo hosts of galaxies in the SPH simu- 
lations, where the condensed baryons may improve the sur- 
vival of dark matter substructures. We now turn to SHAM 
as it is traditionally applied by using the DM simulation, 
which starts from the same initial conditions but includes no 
baryons. Independent SO halos above the resolution limit 
have similar locations and masses in our SPHnw, SPHw 
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and DM simulations. We identify dark matter substructures 
within these SO halos using the AdaptaHOP code as de- 
scribed earlier. While there is reasonable agreement in the 
number and abundance of subhalos between the two SPH 
simulations and the DM simulation, there are positional dif- 
ferences. For each subhalo above our resolution threshold of 
64 particles, we trace its high redshift progenitors up to the 
epoch when it was an independent halo, i.e., up to Zgat . 

The solid curve in each panel of Figure [7| shows (N(M)), 
the mean number of SPH galaxies per halo above a given 
stellar mass threshold in each halo mass bin in the SPHnw 
simulation. The four panels correspond to different stellar 
mass thresholds, and the mean space density of galaxies 
above these thresholds ranges from 0.004 to 0.06 h^Mpc'^. 
The dashed curve shows the mean number of galaxies per 
halo when halos in the DM simulation are populated with 
galaxies using SHAM as described in §3, with a mono- 
tonic relation between stellar mass and Mh (equation 2). 
For comparison, the dotted curves show results of a model 
(SHAMzO) in which we assume a monotonic relationship 
between galaxy stellar mass and z — subhalo mass, 
ra ther than Zsat s u bhalo mas s. Note that th e proc edures 
of I Vale fc Ostrikeil (|2004l ) and [Weinberg et al.l (|2008l ) differ 
from SHAMzO because they apply a global mass-loss correc- 
tion to subhalo masses, though they do not consider post- 
Zss,t mass loss on an object-by-object basis. 

In low mass halos, where the satellite fraction is low, 
both the SHAMzO and SHAM model predictions for the 
average number of galaxies per halo are in good agree- 
ment with the SPH simulation. At high halo mass, however, 
SHAMzO underpredicts the number of galaxies per halo be- 
cause it does not account for subhalo mass loss. The stellar 
masses assigned to subhalos are too low, and galaxies that 
should be above a stellar mass threshold instead fall below 
it. SHAM, on the other hand, remains in good agreement 
with the SPH simulation over a wide range of halo masses 
and galaxy stellar mass thresholds. 

Figure[8]is the analogue of Figure[7l but using the SPHw 
simulation instead of the SPHnw simulation. As winds re- 
duce stellar mass, the number density of galaxies above a 
given stellar mass threshold is lower than in the SPHnw sim- 
ulation. Nonetheless, the trends for the SPHnw simulation 
discussed above still hold. Abundance matching using the 
z = mass of subhalos underpredicts the number of galax- 
ies in massive halos. Although, SHAM (using the Zsat mass 
of subhalos) provides a better fit to the SPH galaxy sample, 
it overpredicts the number of galaxies in massive halos to a 
larger degree than in the SPHnw simulation. The one-halo 
term of the galaxy correlation function depends on the mean 
pair number (N( N — 1)). We have checked that the stan - 
dard prescription l|Kravtsov et al.ll2003 : IZheng et al.ll2005l l 
that satellite galaxies follow Poisson statistics {{Nsat {Nsa.t - 
l)) = (A'sat )^ describes both simulations well, allowing com- 
putation of {iV(7V - 1)) from {N{M)). 

Figure [9] compares the halo occupations of SPH galax- 
ies to the SHAM and SHAMzO populations in each of the 
30 most massive halos at z = 0, for galaxies above the 64 
mspH ~ 5.8x1O'M0 threshold. As already seen in Figures 
[7] and [SI SHAMzO predicts too few galaxies in massive halos 
in both simulations. In the SPHnw simulation, the agree- 
ment between the number of SPH galaxies in each halo and 
the number of galaxies assigned to the halo by SHAM is 



remarkably good, indicating that SHAM is not just repro- 
ducing the typical number of galaxies for a given halo mass, 
but is also capturing the variation in galaxy number at a 
given halo mass. In the SPHw simulation, however, SHAM 
more noticeably overpredicts the number of galaxies in the 
most massive halos, and it does not track the variation in 
galaxy number at a given halo mass as well as in the SPHnw 
simulation. 

Galaxy clustering depends on halo occupation statis- 
tics like those shown in Figures 7-9, and on small scales it 
also depends on the radial profile of satellites in massive ha- 
los. Figure [TO] compares the radial number density profile of 
SPH galaxies around the central galaxy of the halo to the 
radial number density profile of galaxies assigned to halos 
by SHAM and SHAMzO. The left panels correspond to the 
single most massive halo in the simulations, with Mhaio ~ 
4xlO^'*M0, while the right panels correspond to the three 
halos with 10"Mq < Mhaio < 3x10^*Mq. In aU cases, the 
slopes of the radial density profiles are similar for SPH galax- 
ies and for subhalos populated by SHAM and by SHAMzO, 
but the normalisations and the inner truncations (indicated 
by where the curves stop) differ. The normalisation differ- 
ences correspond to the differences in halo occupation at 
high Mhaio seen in Figures [S] and [5] SHAMzO predictions 
are always suppressed relative to SPH galaxies because they 
neglect subhalo mass loss. Tidal stripping is more severe for 
subhalos near the halo centre, exacerbating this effect and 
causing truncation of the SHAMzO profiles at lar ger radii 
compared to SPH galaxies. [Weinberg et al.l ([2008[ ) found a 
similar truncation effect even when including a global cor- 
rection for subhalo mass loss. 

In contrast to SHAMzO, SHAM overpredicts the num- 
ber of galaxies in high mass halos, by a small factor in the 
SPHnw simulation, and by 0.1-0.3 dex in the SPHw sim- 
ulation. This overprediction is a consequence of neglecting 
stellar mass loss that occurs after Zsat in the SPH simulation, 
which is not accounted for in the SHAM recipe. The stellar 
mass loss is more severe for satellites close to the halo cen- 
tre, so in this case the SPH profiles truncate at larger radii 
than the SHAM profiles. 

The overprediction of satellite numbers by SHAM may 
seem surprising in light of the good agreement between as- 
signed and true stellar masses seen in Figure 2, with distri- 
butions that peak at log Ma/ Mr = and have only mild 
asymmetry. However, the relations in Figure 2 (and the ver- 
sions divided by halo mass in Figure 4) only include galaxies 
that remain above the 64 mspH stellar mass threshold at 
z = 0. In high mass halos, a significant fraction of satellites 
have dropped below the mass threshold by z = in the wind 
simulation, but SHAM would place all of these galaxies on 
the Ms-Mh relation traced by the bulk of the satellites. In 
the no-wind simulation, the fraction of satellites that suffer 
such severe mass loss is smaller. 

We suspect that galaxies in the wind simulation are 
more fragile because of their lower baryonic masses, and 
because they are typically less concentrated than galaxies 
formed without winds; in both respects, they more closely 
resemble observed galaxies. However, satellites in the SPHw 
simulation that are disrupted are close to the resolution 
threshold, and we cannot rule out the possibility that their 
disruption is a numerical artifact. For comparison, in Fig- 
ure [To] we also show the radial number density profile of 
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SPH galaxies and SHAM selected subhalos around the cen- 
tral galaxy of the halo only for galaxies above a higher 
mass threshold of T.SxlO^M© in the no winds simulation 
and 2.8xlO^M0 in the winds simulation, corresponding to 
a number density of 0.007 Mpc~^. With the higher stellar 
mass threshold, the differences in the normalisation and in- 
ner truncation of the SPH and SHAM curves is substantially 
reduced. Comparisons of our results with future higher res- 
olution simulations of a similar volume will help clarify the 
extent to which our finite mass resolution affects our results. 

Panels (a) and (b) of Figure [TT] show the two-point cor- 
relation function of SPH galaxies (solid curve) and of ha- 
los and subhalos selected by SHAM (dashed curve) and by 
SHAMzO (dot-dashed curve), in the SPHnw simulation and 
the SPHw simulation respectively. All gala:xies above our 
adopted stellar mass threshold of 64 mspH = S.SxlO^M© 
are included which gives a space density of gala^cies of 0.064 
Mpc"^ in the SPHnw simulation and 0.018 /i^ Mpc"^ in 
the SPHw simulation. SHAMzO underpredicts the correla- 
tion function in both simulations, more severely for SPHw, 
as expected from its underpopulation of massive halos (Fig- 
ures 7-10). At scales r > 2h^^Mpc, the "two-halo" regime 
of the correlation function, the depression of ^(r) reflects a 
drop in the galaxy bias factor from underweighting these 
highly biased halos. At scales r < l/i~^Mpc, where galaxy 
pairs within the same high mass halo make a large contri- 
bution, the suppression of ^(r) is more severe. 

For the SPHnw simulation, agreement between the 
SHAM and SPH-galaxy correlation functions is excellent, 
as one would expect from the close agreement of halo occu- 
pations and radial profiles seen in earlier figures. The largest 
discrepancies are ~10% for both 2/i~^Mpc < r < 10/i~^Mpc 
and r < 2/i~^Mpc. For SPHw, the agreement in the 2-halo 
regime is still very good, with a maximum discrepancy of 
--15% for 2/i"^Mpc < r < 10/i^^Mpc. However, the over- 
population of high mass halos leads to substantial overpre- 
diction of ^(r) in the 1-halo regime, by up to a factor of 
~2.5. This discrepancy arises from the severe stellar mass 
loss that affects a small but not negligible fraction of satel- 
lites in high mass halos and is not captured by the SHAM 
recipe. The impact on three-point or higher order correla- 
tion functions, which more strongly weight the single-halo 
occupations at small scales, would be more severe. 

Panels (c) and (d) of Figure[TT]are the analogues of pan- 
els (a) and (b) respectively, but only show galaxies whose 
parent halos are more massive than lO^^M© (at z = for 
central galaxies and isat for satellite galaxies). This corre- 
sponds to a space density of galaxies of 0.007 Mpc"^ 
and a stellar mass threshold of 7.3x lO^^M© in the no winds 
simulation and 2.8xlO^''M0 in the winds simulation. For 
the SPHnw simulation, the trends discussed above still hold, 
agreement between the SHAM and SPH-galaxy correlation 
functions is better than ~18% on all scales. For the SPHw 
simulation, in contrast with panel (b), the agreement be- 
tween the SHAM predicted and SPH galaxy correlation 
functions is better than ~20% on all scales except r < 
0.3/i~^Mpc, where the discrepancy rises to ~50%. Com- 
pared to galaxies hosted by low mass subhalos, a substan- 
tially smaller fraction of galaxies hosted by more massive 
subhalos undergo severe stellar mass loss. Furthermore, for 
subhalos with Mh (at Zsat ) > IO^^Mq, the fraction of satel- 



lites that survive up to 2 = is only marginally lower in the 
SPHw simulation compared to the SPHnw simulation. 



5 CONCLUSIONS 

Ever since the i dentification o f subs t ructures in N-body 
simulations (e.g. Ghigna et al. [l998l: iKlvpin et al.l 1 19991 : 



iMoore et alll 19991 : 



Springel et al.|[200lr ). t here have been ef- 



forts to associate them with galaxies (e.g. 'Coli'n et al.lll99S^ : 
[kravtsov ct al. 2004; Weinberg et al. 2008). Subhalo abun- 
dance matching (SHAM) has had impressive empirical suc- 
cess, reproducing observed galaxy clustering over a wide 
range of luminosity and redshift and correctly diagnosing 
(via cluster mass-to-light ratios) the overestimated matter 
clustering amplitude ( as ~ 0.9 vs. as ~ 0.8) in WMAPl-era 



cosrnological models jConrov et al. _ 20061 : IVale Sz Ostrikej 
l2006l : lGuo et al.ll2010l : rMoster et aLlbOld ). Our investigation 
provides the first full-scale test of SHAM against hydrody- 
namic cosmological simulations, where the correct identifi- 
cation between galaxies and subhalos is known a priori, in- 
cluding both a simulation with minimal feedback (SPHnw) 
and a simulation with momentum-driven winds that bet- 
ter reproduces the observ ed galaxy stellar mass function 
dOppenheimer et al.ll2010l ). This investigation yields phys- 
ical insight into the galaxy formation process in these sim- 
ulations, and it demonstrates the strengths and potential 
limitations of SHAM as a tool for interpreting observed 
galaxy clustering, significantly extending earlier studies by 
iNagai fc Kravtsov! (j2005l) and I Weinberg et al] |200^). 

When we consider galaxies above our adopted stellar 
mass threshold at z = 0, Ms > 64 mspH = 5.8x10® M©, we 
find a tight correlation between galaxy stellar mass and the 
mass of the parent halo or subhalo. For central galaxies, we 
consider the full mass of the spherical overdensity at z = 0, 
while for satellite galaxies we use the mass of the parent halo 
just before the epoch ^aat when it first becomes a satellite. 
Importantly, the median relation and scatter between Ms 
and Mh are similar for central galaxies and satellite galax- 
ies, in both simulations, and the outlier fraction for satellite 
galaxies is only modestly higher, mainly because of stellar 
mass loss in some satellites after Zsat . As a result, SHAM 
assignment of stellar masses is remarkably effective in both 
simulations. The 68% scatter in _R = log Ma/ Mr, where 
Ma is the assigned stellar mass and Mr the real (simula- 
tion) stellar mass, is aM = 0.09 dex for SPHnw and 0.11 dex 
for SPHw. The distribution of R is only mildly asymmetric, 
with a small extended tail of outliers. Similar results, with 
slightly increased scatter, hold a.t z — 0.5, 1 and 2. We find 
no clear correlation between residuals from the Ms-Mh re- 
lation and the satellite epoch at Zsat or the parent mass of 
the z = Q halo, so our tests do not suggest a way to further 
tighten SHAM by considering additional properties. 

Using _R-band luminosity in place of stellar mass yields 
similar results, but with a larger population of outliers 
among satellites caused by their systematically older stel- 
lar populations and thus higher Ms / L ratios. SHAM should, 
therefore, be applied to stellar masses (estimated from lumi- 
nosity and colour or spectral energy distribution) when pos- 
sible, or to luminosity in redder bands that more faithfully 
trace stellar mass. At z = 2, instantaneous star formation 
rates, which should be a good proxy for observed-frame op- 
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tical luminosities, are well correlated with Mh, with similar 
correlations for central and satellite galaxies. SHAM assign- 
ment of star formation rates at this redshift is quite effective 
with a scatter in log SFRa/SFR_r of 0.12 dex in SPHnw and 
0.16 dex in SPHw. This result reinforces empirical evidence, 
based on galaxy clustering data, that SHAM is an effective 
tool for modeling Lyman-break galaxies at high redshift. 

SHAM is traditionally applied to N-body rather than 
SPH simulations (or to analytic descriptions calibrated on 
N-body). To test this standard form of SHAM, we have ap- 
plied it to the AdaptaHOP subhalo population of a pure 
dark matter (DM) simulation started from the same ini- 
tial conditions as the SPH simulations. Because subhalo 
positions within pare nt halos shift between SPH and DM 
l| Weinberg at al.ll2008l ). we have focused our comparison on 
halo occupation statistics and radial profiles, which together 
determine many properties of observable galaxy clustering, 
and on the real space two-point correlation function. 

Using the galaxy stellar mass function of the SPHnw 
or SPHw simulation as input, SHAM (applied to the DM 
simulation) does quite well in reproducing the correspond- 
ing mean halo occupation, (A''(Mhaio)), and the slope of the 
galaxy radial profile in high mass halos. In SPHnw, but 
not SPHw, SHAM traces the individual halo-to-halo varia- 
tions in galaxy number at similar halo mass. Use of subhalo 
masses at ^aat rather than z = makes a critical differ- 
ence; if we use z = subhalo masses, the galaxy occupation 
in high mass halos is systematically depressed because sub- 
halos lose mass by tidal stripping after becoming satellites. 
Traditional SHAM (using Zsat masses), by contrast, tends to 
overpredict the galaxy numbers in high mass halos. We trace 
this discrepancy to the small but not negligible population 
of satellite galaxies that suffer severe stellar mass loss in the 
SPH simulations, so that they move from above our reso- 
lution threshold at Zsat to below it at 2 = 0. The subhalos 
themselves remain identifiable in the DM simulation, and 
SHAM populates them with galaxies that lie on the main 
Ms-Mh relation. The effect is more significant in the SPHw 
simulation, whose galaxies are apparently more vulnerable 
to severe mass loss because of their shallower baryonic po- 
tential wells. The mean occupation of high mass halos in 
SPHw is overpredicted by 0.1-0.3 dex, while in SPHnw the 
offset is less than 0.1 dex. For SPHnw, the SHAM-predicted 
correlation function agrees with that of SPH galaxies to bet- 
ter than 10% at all scales 0.05 h'^Mpc < r < 10 /i"^Mpc. 
For SPHw, SHAM predicts the correlation function to 15% 
at r > 2 /i^^Mpc but it overpredicts by a factor of ~2.5 at 
r < 0.5 fe~^Mpc because of the overpopulation of massive 
halos. However, if the stellar mass threshold is increased to 
2.7x10^° M0, then the SHAM predicted two-point corre- 
lation function agrees with that of SPH galaxies to better 
than 20% on all scales except r < 0.3 /i~^Mpc, where the 
discrepancy widens to 50%. This improvement is achieved 
because galaxies that suffer severe mass loss are typically 
low stellar mass galaxies in low mass subhalos {Mh < 10^ 
M0). While we would expect low mass galaxies in low mass 
subhalos to be more susceptible to severe mass loss, further 
investigation with higher resolution simulations is required 
to understand the extent to which this is a numerical arti- 
fact caused by proximity of the mass to the threshold above 
which we are reliably able to resolve galaxies. For galaxies 
with luminosity L>L*, our results suggest that SHAM will 



modestly (by ^ 10-30 %) overpredict the two-point correla- 
tion function on small scales and yield accurate clustering 
predictions on larger scales. 

In both of our SPH simulations, halo mass (at z = for 
central galaxies or Zaat for satellite galaxies) is the primary 
determinant of galaxy stellar mass, luminosity, and (at high 
redshift) star formation rate. The most significant secondary 
factor, and thus the most significant limitation on the ac- 
curacy of SHAM, is the small fraction of satellite galaxies 
that suffer severe stellar mass loss even though their host 
subhalos survive. The sensitivity to feedback, indicated by 
the difference between our SPHnw and SPHw simulations, 
motivates further investigation of galaxy mass loss in high 
mass halos for a wider range of feedback prescriptions. 

For galaxies with L > Lt, the SPHw simulation predicts 
excessive galaxy masses and excessive late-time star forma- 
tion, indicating the need for an additional physical mech- 
anism such as AGN feedback. The accuracy of SHAM for 
high luminosity galaxies will depend on how tightly corre- 
lated such feedback is with halo mass. Overall, however, the 
strong role of dark matter in governing galaxy formation 
makes subhalo abundance matching a powerful technique 
for making realistic artificial galaxy catalogues and for in- 
terpreting the observed distribution of galaxies. 
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log Mg/M^ 

Figure 1. Galaxy stellar mass functions at z = in the SPHw simulation (dashed), which incorporates momentum driven winds, and 
the SPHnw simulation (solid), which does not. In this and later plots, Mg refers to the stellar mass of SKID-identified galaxies. 
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Figure 2. (Left) Stellar mass at 2 = versus halo mass in the SPHnw simulation (top) and the SPHw simulation (bottom). Each 
point represents an SPH galaxy and we only include galaxies above the Mg = 64mgpH = 5.8 X 10^ Mq threshold at 2 = 0. For central 
galaxies, shown as black points, the halo mass is the 2 = mass of the host halo, while for satellite galaxies, shown as red points, the 
halo mass is the mass of the parent halo just before 2sat , the epoch at which it became a satellite. Only galaxies that are in the top 
5% or bottom 5% by stellar mass in each 0.25 decade wide halo mass bin (relative to all galaxies in the bin) are shown. The solid and 
dotted curves show the median stellar mass in each halo mass bin for all galaxies and satellite galaxies respectively. (Right) Probability 
distribution of the ratio of stellar mass assigned by subhalo abundance matching to SPH galaxy mass in the SPHnw simulation (top) 
and SPHw simulation (bottom), with the solid curve standing for all galaxies and the dotted curve for satellite galaxies. 
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Figure 3. Probability distribution of tiic ratio of stellar mass assigned by subhalo abundance matching to SPH galaxy mass in the 
SPHnw simulation (left) and SPHw simulation (right). The solid, dot-dashed, dashed and dotted curves represent redshifts, z = 0, 0.5, 
1 and 2 respectively. 
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Figure 4. Stellar mass of satellite galaxies versus parent halo mass just before z^iit- Each point represents a galaxy, and only galaxies 
that are in the top 5% or bottom 5% by stellar mass in each 0.25 decade wide halo mass bin (relative to the distribution of all galaxies) 
are shown. In panel (a), the circles, triangles, squares and crosses stand for different z = host halo mass bins while in panels (b) and 
(c), they stand for bins of Zsat, the epoch of accretion of the satellite and AM/M, the change in stellar mass since Zeat, respectively. 
Panels (d), (e) and (f) arc analogous to panels (a), (b) and (c) but using the SPHw simulation. 
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Figure 5. (Left) H-band luminosity at 2 = versus halo mass in the SPHnw simulation (top) and the SPHw simulation (bottom). Each 
point represents a galaxy in the SPHnw simulation. For central galaxies, shown as black points, the halo mass is the 2 = mass of the 
host halo, while for satellite galaxies, shown as red points, the halo mass is the mass of the parent halo just before z^at • Only galaxies 
that are in the top 5% or bottom 5% by ij-band luminosity in each 0.25 decade wide halo mass bin are shown. The solid and dotted 
curves show the median i?-band luminosity in each halo mass bin for all galaxies and satellite galaxies respectively. (Right) Probability 
distribution of the ratio of ij-band luminosity assigned by subhalo abundance matching to ij-band luminosity of SPH galaxies computed 
using a stellar population synthesis code in the SPHnw simulation (top) and SPHw simulation (bottom), with the solid curve standing 
for all galaxies and the dotted curve for satellite galaxies. 
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Figure 6. Relation between galaxy SFR and stellar mass (left), and accuracy of SHAM SFR assignment (right), for galaxies in the 
SPHnw (top) and SPHw (bottom) simulations at z = 2. The format is the same as Figures 2 and 5, but with instantaneous SFR used 
in place of stellar mass or ij-band luminosity. Only galaxies with stellar mass above the 64 mgpH threshold at z = 2 are included in the 
relations. 
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Figure 7. Mean number of galaxies per halo versus halo mass. The solid curve represents the SPHnw simulation, while the dashed and 
dotted curves are results from populating halos in our N-body simulation with galaxies using subhalo abundance matching (SHAM) with 
subhalo masses at Zsat and at z = (SHAMzO), respectively. Each panel stands for a different galaxy stellar mass threshold, for which 
the corresponding space density of galaxies is also indicated. Curves are truncated when the mean occupation of halos in the next lower 
(0.1-dex) mass bin falls below 0.03. 
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Figure 8. Mean number of galaxies per halo versus halo mass. This figure is analogous to figure [7] but using the SPHw simulation, 
which includes momentum driven winds. 
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solid curve represent SPH galaxies, while the dashed and dotted curves represent SHAM and SHAMzO respectively. All galaxies above 
the resolution threshold are included in this figure. 
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Figure 10. Radial number density profile of SPH galaxies (black solid curve) and halos in the N-body simulation populated with 
galaxies using SHAM (blue dashed curve) and SHAMzO (red dot-dashed curve), for the mass threshold of Mg = 5.8 X lO'^ Mq. The top 
two panels are for two halo mass bins in the SPHnw simulation while the bottom two panels are for two halo mass bins in the SPHw 
simulation. The curves stop when the only interior galaxy is the central galaxy. Also shown are radial number density profiles of SPH 
galaxies (gray solid curve) and SHAM (purple dashed curve) selected halos using a higher threshold of Ms = 7.3 xlO'^'^ Mq in the no 
winds simulation and Mg = 2.8 xlO^" Mq in the winds simulation. 
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Figure 11. Two-point correlation function of SPH galaxies (solid curve) and halos and subhalos in the N-body simulation populated 
using SHAM (dashed curve) and SHAMzO (dot-dashed curve). Panels (a) and (b) show all galaxies above the stellar mass resolution 
threshold (5.8 X 10® Mq) in the SPHnw (no winds) and SPHw (winds) simulations respectively. Panels (c) and (d) show galaxies above 
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